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Foreword 


There is incredible interest and energy today in addressing issues of human capital in K-12 education, 
especially in the way we prepare, evaluate, pay, and manage teachers. States have been developing and 
implementing systems intended to improve these practices, with a considerable push from foundations 
and the federal government. 

As we start to rethink outdated tenure, evaluation, and pay systems, we must take care to respect 
how uncertain our efforts are and avoid tying our hands in ways that we will regret in the decade ahead. 
Well-intentioned legislators too readily replace old credential- and paper-based micromanagement with 
mandates that rely heavily on still-nascent observational evaluations and student outcome measurements 
that pose as many questions as answers. The flood of new legislative activity is in many respects wel- 
come, but it does pose a risk that premature solutions and imperfect metrics are being cemented into 
difficult-to-change statutes. 

AEI’s Teacher Quality 2.0 series seeks to reinvigorate our now-familiar conversations about teacher 
quality by looking at today’s reform efforts as constituting initial steps on a long path forward. As we 
conceptualize it, “Teacher Quality 2.0” starts from the premise that while we've made great improve- 
ments in the past ten years in creating systems and tools that allow us to evaluate, compensate, and 
deploy educators in smarter ways, we must not let today’s “reform” conventions around hiring, evalua- 
tion, or pay limit school and system leaders’ ability to adapt more promising staffing and school models. 

Before we can begin to address these issues, we have to reflect on how we got to our current posi- 
tion. In this first installment of the series, Arnold Shober, associate professor of government at Lawrence 
University, sets the historical context for this 2.0 dialogue by tracing the evolution of our conception of 
teacher quality throughout the past decade. Shober describes the changing political pressures, milestone 
policy changes, and notable research findings that have molded today’s approaches to boosting educator 
effectiveness. He argues that we have seen a shift in paradigm from trust to measurement, adjusting our 
view of teaching from something immeasurable to something we can parse and compare. Although 
these developments have created challenges of their own, they have encouraged schools and school dis- 
tricts to think about teacher effectiveness in a new way. 

I am hopeful that you will find Shober’s paper to be as enlightening and thought-provoking as I 
have in helping make sense of the road we've paved to reach our current state. For further information 
on the paper, Shober can be reached at arnold.shober@lawrence.edu. For additional information on the 
activities of AEI’s education policy program, please visit www.aei.org/hess or contact Lauren Aronson at 
lauren.aronson@aei.org. 


—FREDERICK M. HESS 
Director of Education Policy Studies 
American Enterprise Institute 
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Executive Summary 


The concept of “teacher quality” has undergone a pro- 
found transformation in the last decade. Through the late 
1990s, most policymakers assumed that educator effec- 
tiveness was immeasurable and that our only hopes to 
increase it were tied to classroom experience and aca- 
demic credentials. Yet since 2001, through a series of 
notable research findings, changing political pressures, 
and landmark policy changes, we have come to view 
teacher quality as independent of licensure and individu- 
ally measurable. We now approach evaluating the quality 
of our teachers by measuring their ongoing performance 
in the classroom. The key markers in the transformation: 


¢ Declining trust in teacher education: Data from the 
Higher Education Act of 1998 showed that teacher 
education programs graduated students of widely 


variable ability. 


Ineffective credential requirements: Despite federal 
requirements for high-quality teachers, by 2006, 


students in high-poverty schools were still more 
likely to have lower-quality teachers than their peers 
in other schools. 


Bipartisan agreement: By 2009, major federal 
politicians in both political parties, including 
Democratic politicians heavily supported by unions, 
agreed that low teacher quality was a barrier to 


high-quality education. 


¢ New era of measurement: No Child Left Behind 
created a treasure trove of individual student data 
that, though imperfect in many ways, led to new 
opportunities for research to link teachers and 
students. 


No consensus has emerged about how to improve 
teacher quality, but policymakers continue to experiment 
with teacher preparation programs, recruitment incen- 
tives, tenure provisions, and differential pay. 


From Teacher Education to Student Progress 
Teacher Quality Since NCLB 


Arnold F. Shober 


It is hard to imagine a time when teacher quality was 
not a central piece of American education policy, yet the 
meaning of “quality” has undergone a profound transfor- 
mation in the last decade. Through the late 1990s, policy- 
makers and district personnel trusted teacher credentials 
as a marker of quality in the education system, and they 
paid little attention to the variation in classroom effects 
despite the common credential. Since then, the combina- 
tion of a renewed public emphasis on improving academic 
achievement and new research on teacher effectiveness 
has prompted policymakers to question the trustworthi- 
ness of linking certification to quality. 

Equating quality with qualifications was a reasonable 
assumption before the 1990s as the learning process was 
not well understood, and an emphasis on training and 
retaining expertise was appropriate under such circum- 
stances. In effect, earning a credential through training 
meant that teachers could be trusted to make the right 
decisions on their own in an uncertain educational envi- 
ronment. Further, in the mid-twentieth century, the 
National Education Association (NEA) and the American 
Federation of Teachers (AFT) took the view that teachers 
were professionals first, who needed job security to rein- 
force public trust in their decisions, and that teachers 
could be trusted to self-regulate, claiming that “what 
teachers want is what children need.” To preserve teacher 
experience, the nation’s major teachers’ unions prevailed 
upon state legislatures to extend collective bargaining to 
them in the name of teacher quality. Later, in the 1980s, 
many states addressed teacher quality by requiring new 
teachers to have an official teacher-mentor. In their view, 
certified teachers could be trusted to do what was right 
for children without outside supervision—and, by impli- 
cation, were interchangeable—and they largely suc- 
ceeded. Once certified, one elementary teacher was as 
good as any other elementary teacher. At century's end, 
both the 1998 reauthorization of the Higher Education 
Act (HEA) and the 2001 passage of No Child Left 
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Behind (NCLB) accepted the view that quality was the 
combination of training and experience. 

But by the late 1990s, it was abundantly clear that 
this definition was fiction. Teacher qualifications did not 
guarantee teacher quality. Thirty years of data showed 
that students systematically learned more in some class- 
rooms than others and that disparities in learning could 
be tied to disparities in teacher quality. Certified teachers 
were not, as it turned out, interchangeable; they were 
individuals with strengths and weaknesses. Trusting cre- 
dentialing to produce teachers with equal abilities has 
hindered recruiting the best and brightest college gradu- 
ates into teaching, prevented schools from rewarding the 
best teachers, and hurt low-income, low-performing stu- 
dents as teachers sort themselves into more desirable 
schools. Policymakers began to look back further in the 
policy chain. If credentialing could not be trusted to pro- 
duce reliable inputs to K-12 education, then what could? 

Even as the policy problem crystallized, “boosting 
teacher quality” was not a well-defined set of policy pre- 
scriptions in 2001. Foreshadowing forty years of research, 
James Coleman's Equality of Educational Opportunity (EEO) 
reported in 1966 that “the quality of teachers shows a 
stronger relationship to pupil achievement [than school 
facilities],. . . [but] the results are not at all conclusive 
regarding the specific characteristics of teachers that are 
most important.” ’ As late as 1999, Dan Goldhaber and 
coauthors suggested that 97 percent of teacher quality 
was “intangible.” For researchers and policymakers, then, 
identifiable components of good teaching were a mystery.” 

But the lack of a clear theory of action did not stop 
policymakers from constructing a causal narrative about 
what some thought should make a good teacher: improv- 
ing teacher education and increasing teacher credentials. 
In light of the dramatic uncertainty surrounding teach- 
ing, these were some of the few items that policy could 
control. Federal policymakers adopted this approach in 
HEA and NCLB, following a path well trodden by state 
departments of education.’ But, as with earlier teacher 
quality policies, this status-quo trust of credentials for 
teacher quality was long on gut feeling and short on 


empirical evidence. Increasing funding for colleges of 
education, even with additional oversight, did not upset 
existing legislative-political alliances with universities. 
Increasing requirements for licenses tinkered with the 
front end of teachers’ careers and did not threaten teach- 
ers unions’ membership. 


Thirty years of data showed that 


students systematically learned more 


in some classrooms than others and 
that disparities in learning could be 


tied to disparities in teacher quality. 


Yet since 2001, state and federal policymakers have 
accomplished a revolution in the conceptualization of 
American teacher quality in ways that were unanticipated 
by either HEA or NCLB. The change in policy is 
twofold. First, policymakers subtly shifted the framing of 
teaching policy. Instead of treating teaching as difficult to 
parse and compare, policymakers now suggest that the 
outputs of teaching are understood well enough to quan- 
tify, measure, and manipulate them. When policymakers 
regarded credentials as a reliable input, they considered 
the autonomous decisions that teachers made in uncer- 
tain circumstances generally trustworthy. Teachers often 
supported this view by arguing that they needed to tailor 
their methods to different students’ unique needs, which 
could not be anticipated by uniform policy prescriptions. 
Thus, little effort was made to measure the value of a cre- 
dential to teachers’ performance in their classrooms. Now, 
as policymakers believe they understand the process of 
teaching and learning better, they seek to verify the value 
of teachers’ credentials by setting a minimum, predefined, 
and assessable standard of quality. NCLB aided this 
change through its emphasis on measurement, particu- 
larly on reading and math performance, and its narrow 
focus on student assessment, thereby allowing scholars 
to link teacher quality to student outputs with little lag 
time. This was a major advance from the 1990s, when 
scholars were still analyzing thirty-year-old EEO data, 
picking through data from Tennessee’s class-size-reduction 
experiment in the 1980s, and drawing weak claims about 
quality using cross-sectional administrative figures. 


Second, politicians in both major parties adopted this 
framing. More to the point, major Democratic politicians 
argued that trusting credentials to ensure student per- 
formance undercut educational equity. Thus, overwhelm- 
ingly Democratic teachers’ unions have struggled to halt 
evaluation based on student performance, differential pay, 
and alternative certification because their fellow partisans 
in government champion these policies.’ Credentials were 
meant to ensure that teachers were highly qualified as a 
group, but credentials have lost legislators’ trust. Legisla- 
tors and policymakers want to measure the value of indi- 
vidual teachers. 

As policymakers and scholars debate the reauthoriza- 
tion of the Elementary and Secondary Education Act 
(ESEA) in 2012, reconceptualizing teaching as a measura- 
ble endeavor—partnered with new evidence about effec- 
tive teaching from NCLB data—has made possible 
meaningful, long-term improvements to teacher quality 
through federal law. This essay traces the development of 
this new focus, considers potential causes for the change, 
and suggests how major teacher effectiveness proposals fit 
into this paradigm. 


From Qualifications to Quality 


Lawmakers passed the ESEA in 1965 in the interest of 
advancing equity for low-income, nonwhite children, 
primarily through supplementing local and state school 
spending with federal dollars. The focus of the act, 
implicit in the title of Coleman’s companion study, was 
creating educational opportunity, with less focus on educa- 
tional outcomes. Policymakers recognized that they had 
little evidence regarding effective practices within schools, 
but they fell back on a reasonable guess: if low-income 
and nonwhite children had access to the same resources as 
middle- and high-income white children, their academic 
performance would follow. Unfortunately for the act’s 
original supporters, evidence from the National Assess- 
ment of Educational Progress (NAEP) in the subsequent 
thirty years suggested that the academic performance of 
nonwhite and low-income children showed little improve- 
ment given the scale of federal involvement. Simultane- 
ously, a handful of state court decisions in the 1980s and 
1990s—notably Rose v. Council for Better Education 
(1989) in Kentucky—argued that untargeted spending 
would do little to overcome these educational disparities. 
The combination of ESEA’s stated concern for edu- 
cational equity and stubbornly low performance left poli- 
cymakers in a bind. Clearly, the focus on input measures 


would neither make the resources in low-income schools 
equal to their upscale counterparts nor substantively 
improve students’ academic performance. This conun- 
drum opened the window for cross partisan experimenta- 
tion and presaged the shift in the conceptualization of 
teaching. While ESEA would continue to focus on equity, 
the act’s emphasis would shift to equality of outputs. The 
1994 reauthorization of ESEA, the Improving America’s 
Schools Act (LASA), called for academic state standards, 
academic testing, and wider participation in NAEP, edg- 
ing federal concern toward results. Seven years later, 
NCLB added teeth to IASA’s requirements, but the law 
only nudged federal policy to redirect its focus toward the 
central component of classroom-based learning: teachers. 

The central role accorded teachers at the turn of the 
century marked a significant shift from 1965, when policy- 
makers believed financial resources were at the core of 
education achievement. Nonetheless, policymakers in 
1998 and 2001 had little more information about teach- 
ers’ direct influence on students than policymakers had in 
1965. Scholarly evidence available in 1998 suggested that 
strong teachers could improve student academic achieve- 
ment compared to other school factors, but the ow 
remained unclear.’ Teachers were the closest to the learn- 
ing process, and the uncertainty surrounding effective 
teaching practices encouraged policymakers to maintain a 
veil over the practice of teaching. They continued to tar- 
get only the entry points into teaching: certification and 
colleges of education. 


Teacher Quality as Teacher Education. The opening 
salvo in the fight leading from LASA to HEA and NCLB 
came from the National Commission on Teaching and 
America’s Future (NCTAF). Its 1996 report, What Mat- 
ters Most: Teaching for Americas Future, reported that 27 
percent of newly hired teachers did not have standard cre- 
dentials when beginning their work—almost half of these 
had no credentials at all. More damning was that these 
teachers were concentrated in the highest-poverty, least- 
white schools. For example, the report documented that 
only 54 percent of math teachers in predominantly non- 
white high schools had a major in their field, while 86 
percent of teachers in predominantly white schools did.” 
NCTAF also noted that the requirements for earning a 
teaching license varied widely among the states and that 
teacher preparation programs in institutions of higher 
education were uninspiring, fragmented, superficial, and 
backward-looking. In short, teacher preparation programs 
were “caught in a vicious circle of mediocre practice 
modeled after mediocre practice.”® These delinquencies 


prompted Rep. George Miller (D-CA), then the second- 
ranking Democrat on the House Committee on Educa- 
tion and the Workforce, to argue that teacher preparation 
programs were “perpetrating a fraud on the public.”® 
Among other things, the NCTAF report recom- 
mended that standards for both teachers and students be 
raised; in particular, it urged increased standards for licen- 
sure and a thorough overhaul of teacher preparation pro- 
grams. The bipartisan cast of authors who produced the 
report, including Ted Sanders, Albert Shanker, Jim Edgar, 
and James B. Hunt, gave it significant sway in the Clin- 
ton administration and the Republican-controlled Con- 
gress. President Bill Clinton and Senator Jim Jeffords 
(R-VT) echoed the report’s conclusions: the United States 
needed more teachers; those teachers needed more sub- 
ject-matter training; teacher education was divorced from 
real-world classrooms; and some traditional teacher certi- 
fication programs degraded overall teacher quality.'° 


The central role accorded to 
teachers at the turn of the century 


marked a significant shift from 


1965, when policymakers believed 


financial resources were at the 


core of educational achievement. 


None of these proposed reforms substantively chal- 
lenged the core framing of teaching as an output. Teach- 
ing could still be controlled through licensing, and once 
licensed, teachers would still act as a policy input, free to 
adapt strategies as they saw fit. In a 1998 interview, Leila 
Vickers, dean of the school of education at North Car- 
olina A & T State University, argued that teacher prepara- 
tion programs should revel in this uncertainty: 


I would like to see the state make policies that allow us 
teacher educators to develop a “pluralistic” approach to 
assessment. If we are going to have professional teachers 
model effective behaviors, to set high expectations for all 
students, we have to begin in the preparation program to 
say, “All individuals do not learn in the same way.” . . . We 


must employ strategies that use the modalities through 
which they learn.!! 


Despite maintaining an input view of teaching, the 1998 
reauthorization of the HEA was a fundamental departure 
in federal education policy. Teaching had been mentioned 
in a disorganized, catchall Title V since the act’s origin, 
but in 1998 Congress created a new Title II that focused 
exclusively on teacher quality. For the first time, the fed- 
eral government would require any college of education 
that received federal money to publicly report how its 
teacher preparation programs were assessed, to publish 
how many of its teaching candidates passed state licensure 
exams, and to construct their exams to measure “teacher 
candidate knowledge and skills.”!* This was a compro- 
mise, as Rep. Miller’s vigorous fight for federal teacher 
preparation standards ultimately failed. 


For the first time, the federal 
government would require any 


college of education that received 


federal money to publicly report 


how its teacher preparation 


programs were assessed 


Teacher Quality as an Individual Characteristic. Work 
in Congress on NCLB had already begun when HEA 
passed, and NCLB’s Title II standards for “highly quali- 
fied teachers” (HQT) were a direct outgrowth of the 
teacher preparation debate for HEA. Observers lamented 
that NCLB appeared to pay little attention to bolstering 
teacher quality—only $2.8 billion of the Department of 
Education’s $32 billion K-12 appropriation—but this 
time, Rep. Miller was instrumental in ratcheting up 
accountability for teacher quality.!> HEA demanded 
transparency from institutions responsible for preparing 
teachers but remained silent on how states and school dis- 
tricts were to address the existing inequitable distribution 
of teacher quality. In contrast, NCLB placed the onus on 
teachers by requiring that every teacher in schools receiv- 
ing Title I money be “highly qualified” by 2005-2006. 


Under NCLB, a highly qualified teacher was one who 
had a bachelor’s degree, full state certification, and 
demonstrated knowledge of his or her subject matter. The 
subject-matter requirement, drawn from the NCTAF 
report, was a small step toward requiring highly effective 
teachers, yet NCLB continued to assume that teachers 
were essentially equal in what they would bring to the 
classroom as long as they met HQT. Indeed, Stanford 
University professor Linda Darling-Hammond argued in 
the fall of 2001 that certification was the “one lever” that 
states had to improve teacher quality.'4 

NCLB’s dramatic approach to assessing student 
achievement—requiring all students to meet a fixed 
standard on a fixed timeline—ran interference for policy- 
makers who sought to reframe teacher quality in the same 
way. If schools and districts could be held accountable for 
their students’ academic performance, then colleges of 
education should be held equally accountable for their 
graduates’ performance. US Secretary of Education Rod 
Paige was an eager participant in hammering teacher 
preparation programs. Using data from HEA-required 
reports in 2002, he showed that only one state, Virginia, 
set a passing score even close to the national average for 
individual certification exams. (That is, in most states, an 
aspiring teacher could meet the requirement by scoring 
significantly below the fiftieth percentile.) The rest were 
below. The Education Trust thought he was too chari- 
table; a report from the group vigorously attacked states 
for providing “inconsistent, incomplete, and utterly 
incomprehensible” data.!° With a showing this bad, Paige 
argued, why should teacher candidates attend colleges of 
education at all? He suggested instead that neither teach- 
ers nor colleges of education should be trusted simply on 
the basis of their credentials or accreditation. Predictably, 
many schools of education and the NEA blasted the 
Department of Education’s report. But the shift in fram- 
ing toward measurement was undeniable, and the politi- 
cal tide had begun to shift. 

Although NCLB’s Title II could have been used to 
push merit pay, targeted professional development, or 
new rules for teacher tenure, Bush administration officials 
were hesitant to push for teacher effectiveness measures 
beyond those geared for colleges of education. First, 
research on increasing teacher effectiveness was thin. As 
late as 2005, authors of an American Educational Research 
Association report lamented that they could find no 
research linking teacher accreditation programs to student 
learning. Second—and more important politically—Title 
II funds were seen as the “consummate pot of local-con- 
trol money,” as later characterized by Secretary Margaret 


Spellings.!° Administration officials did not want to fur- 
ther antagonize the bipartisan chorus of state and local 
officials who had already decried NCLB for violating one 
of the last state-centric policy areas. 


As late as 2005, authors of an 
American Educational Research 
Association report lamented that 


they could find no research linking 


teacher accreditation programs 


to student learning. 


Nevertheless, Title II did unintentionally abet advo- 
cates of assessing the outputs of teaching. As states drafted 
HQT plans, it became apparent that they were circum- 
venting the spirit of HQT if not the letter itself. Law- 
makers knew that the 1998 HEA would affect only new 
teachers, so any significant improvements in the quality 
and distribution of the 2.9 million-strong teacher corps 
would be years away.!’ Eager to provide quicker improve- 
ments, lawmakers required all teachers to meet NCLB’s 
HQT definition, but states could use an alternative High, 
Objective, Uniform State Standard of Evaluation 
(HOUSSE) to help existing teachers satisfy HQT. 

As it turned out, HOUSSE proved to be neither 
uniform nor objective in many states. In 2004, New 
York’s HOUSSE allowed a teacher to provide evidence 
of either five graduate courses or the combination of 
supervising a student teacher and a bachelor’s degree in 
education. Georgia equated two professional develop- 
ment conferences with a doctoral degree. At the other 
end of the spectrum, Illinois required twenty-four credit 
hours of coursework in a teacher's content area.!8 In 
2005, Wisconsin claimed that 99.5 percent of its core 
classes were taught by teachers who met HQT; Massa- 
chusetts claimed 93 percent, Alabama 81.8 percent, and 
California 74 percent.!? In 2007, data from the states 
showed that HOUSSE still left low-income, nonwhite 
students with less experienced teachers: 13 percent of 
highly qualified teachers in schools with more than 75 
percent nonwhite students had fewer than three years’ 


experience. For schools with less than 25 percent non- 
white students, the figure was only 5 percent. High- 
poverty schools showed a similar disparity.”° 

These disparities were exacerbated by states’ slow 
progress toward the law’s 100 percent benchmark for 
2005-2006. To the chagrin of Rep. Miller and Sen. Ted 
Kennedy (D-MA), Secretary Spellings relented to pres- 
sure from teachers’ groups and state departments of edu- 
cation in early 2004 and weakened HQT.”! Now, 
teachers would only have to demonstrate “broad-field” 
competence (e.g., “science”) rather than a subject-specific 
test (e.g., physics). Also, states could design a single pro- 
cedure to test for competence rather than requiring mul- 
tiple assessments. By the official deadline, some forty 
states had not met the requirement, and political reality 
forced the Department of Education to grant an addi- 
tional year to those states making a “good-faith effort.””” 

Spellings recognized the tremendous loophole that 
HOUSSE had created and the incongruence of holding 
students accountable for performance while holding 
teachers accountable only for their qualifications. In May 
2006, the Department of Education announced that it 
would no longer allow states to use HOUSSE except in 
narrowly defined situations (rural teachers of multiple 
subjects in high school, special education teachers of mul- 
tiple subjects, and limited-term foreign teachers), with the 
support of Sen. Michael Castle (R-DE) and Miller. This 
time, Miller complimented Spellings for taking HQT 
“seriously.” But both the Council of Chief State School 
Officers and the NEA argued that Spellings had no legal 
authority to override the language of NCLB, and by fall 
2006, the department partially retreated from its plans. In 
a letter to chief state school officers, Spellings chastised 
the states for permitting teachers to use HOUSSE’s “sub- 
stantially less rigorous” standards to demonstrate compe- 
tency, but she revised the department’s earlier push by 
noting that it would seek to end HOUSSE in the reau- 
thorized bill rather than through the rule-making process.”4 
Teacher Quality as a Bipartisan Enterprise. A draft bill 
to reauthorize NCLB in 2007 continued to take small 
steps away from a focus on qualification toward a focus 
on quality. Similar to the 1998 HEA provisions, the dis- 
cussion draft of Title II circulated by Reps. Miller and 
Buck McKeon (R-CA) would have required states to 
report average teacher tenure and the number of first-year 
teachers, as well as disparities between high- and low- 
poverty schools.” 

While these and other attempts have died in Con- 
gress, President Barack Obama’s Department of Education 


has completed the federal transition from the conception 
of teaching as a credential-based policy input to an evalu- 
ated policy output. Instead of overturning the Bush-era 
accountability focus as some of Obama’s supporters 
hoped, Secretary Arne Duncan amplified it. First, despite 
strides made by colleges of education over the previous 
decade, Duncan still called teacher preparation programs 
in 2010 the “Bermuda Triangle of higher education . . . 
no one knows which students are succeeding as teachers, 
which are struggling, and what training was useful or 
not.””° Unlike Secretary Paige, who channeled similar 
criticisms into a charge for alternative certification pro- 
grams, Secretary Duncan pushed states to link student 
performance data to individual teachers and principals 
in the department’s Blueprint for Reform.7’ His views 
appeared in the 2009 stimulus bill in the form of require- 
ments for states to report teachers’ scores on certification 
tests upon entering teaching.”® Second, the Obama 
administration put teacher effectiveness at the center of 
educational equity in its goals for a reauthorized ESEA. 
Although the administration’s main push, Race to the 
Top (RIT), was touted as a departure from the punitive 
policies of NCLB, fully 28 percent of points in the RTT 
rubric were earmarked to ferret out effective teachers and 
principals. But the actual percentage was much higher 
because the rubric included verbiage in other sections 
about identifying “effective” teacher preparation pro- 
grams, “effective” instructional strategies, and “effective” 
schools identified by student-level data. By one calcula- 
tion, 53 percent of the rubric was tied to data-driven, 
“effective” education.” Teachers’ performance output 
would be monitored throughout their careers. 

Although some Democrats had been supporters of 
measuring teacher quality back in 2001, the renewed 
push to measure teacher effectiveness from the Obama 
administration seriously undercut teachers’ unions’ politi- 
cal firewall against the evaluation of teachers’ individual 
performance. In 2006, the NEA adopted an ambitious 
plan to overhaul NCLB after it failed to check the legisla- 
tion in court. The NEA recognized that it was politically 
“lonely,” in the words of an executive committee mem- 
ber, and argued that teachers had been left out in writing 
the original bill.*° Its proposal explicitly delinked teachers 
from student performance because “teaching is not an 
individual, isolated profession [but] . . . dependent on the 
entire network of teaching professionals.”*! It also sug- 
gested loosening the definition of “highly qualified” to 
include an academic minor in a subject and permitting 
one state’s definition of “highly qualified” to meet every 
other state’s definition. Yet, members of Congress charged 


with revising the law were hesitant to give the NEA a 
prime place at the table—both Republicans and Demo- 
crats on the committees classed the NEA as one of a 
“wide array” of stakeholders.*” 

Yet, by 2010, it became apparent to the leadership 
of the teachers’ unions that the federal government was 
not going to back down. In addition to zeroing in on 
teacher effectiveness, both President Obama and Secre- 
tary Duncan supported charter schools, school reconsti- 
tution, and versions of merit pay, all highly questionable 
from the unions’ perspective. This was not unnoticed. 
In a major shift, both the AFT and NEA acknowledged 
that teacher effectiveness could be measured. AFT presi- 
dent Randi Weingarten remarked: “Deliberately or not, 
President Obama, whom I supported, has shifted the 
focus from resources and innovation and collaboration 
to blaming it all on dedicated teachers.”*? Yet she sought 
to portray the AFT as open to measuring teacher effec- 
tiveness. In a speech to the National Press Club in Janu- 
ary 2010, Weingarten criticized standardized testing but 
agreed that for evaluating teachers, “Student test scores 
based on valid and reliable assessment should also be 
considered—xot by comparing the scores of last year's 
students with the scores of this year’s students, but by 
assessing whether a teacher's students show real growth 
while in his classroom.”** Although Weingarten indi- 
cated that teacher performance measures should also 
consider classroom observation, self-evaluations, and 
portfolio reviews, the admission of test scores to the list 
underscored a dramatic change from a decade earlier. 
For the AFT, teachers were individuals with distinct 
strengths and weaknesses. 

Yet the AFT has long been more open to experi- 
menting with education reform, so perhaps this change 
was less striking. The NEA faced a more difficult fight. 
At its July 2011 national convention, the union endorsed 
Obama for reelection, but its delegates were “appalled” 
by Secretary Duncan and the administration’ failure 
“to respect and honor the professionalism of teachers,” 
according to an adopted resolution.**> NEA president 
Dennis Van Roekel recognized that for the NEA to “get 
into that arena,” it would have to concede that student 
performance was part of teacher effectiveness.*° Though 
significantly weakened from an initial draft, delegates 
approved language acknowledging that student exams 
could be used to evaluate teacher effectiveness if they 
were “developmentally appropriate, scientifically valid, 
and reliable for the purpose of measuring both student 
learning and a teacher's performance.”*” Whether any 
exam would ever meet the NEA’s standard is uncertain, 
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but the organization no longer wanted to be kept outside 
the bargaining room when both parties, major think 
tanks, and many policy researchers suggested that teacher 
effectiveness was tightly linked to learning. 

Whatever the cause, once the national teachers’ 
unions admitted that teachers could be evaluated inde- 
pendently of their credentials, it was clear that teaching 
could no longer be undergirded with trust alone. Policy- 
makers across the spectrum demanded measurement. In 
2012, policymakers—and the unions—have moved to 
draft state accountability systems to include teacher per- 
formance. Under advice from the Department of Educa- 
tion, many states intend to evaluate teachers with up to 
half of their measure coming from student test scores.** 
When the Obama administration granted eleven states 
waivers from having to meet Adequate Yearly Progress 
(AYP) toward 100 percent student proficiency in early 
2012, it was in return for bolstering teacher effectiveness 
measures and increasing the number of students poten- 
tially covered by assessment systems.*? Although the 
push to evaluate the outputs of teaching did not alter 
existing law or even dominant practices, this new con- 
ception gave reform-minded policymakers political 
cover for substantive changes to the relationship among 
teachers, students, and schools. 


New Research, New Technology, 
New Frame 


The reimaging of teaching took a decade to percolate 
through the back rooms of major stakeholders in Ameri- 
can education policy, but it did not circulate simply 
because it was trendy, politically popular, or even particu- 
larly well thought out. Indeed, in 2001 teacher quality 
measures garnered very little attention in the initial 
debates over NCLB, were subject to strong pushback 
from teachers’ organizations, and generated a host of 
critics both for and against. 

Instead, the secular change from teacher-as-input to 
teaching-as-output was driven by two major external 
developments. First was the emergence of credible 
research in the late 1990s and early 2000s that challenged 
long-held assumptions about the relationship of licensure 
to quality. Second was that NCLB’s mandatory testing 
requirements created a treasure-trove of data on the indi- 
vidual student level that, though imperfect in many ways, 
created research and political pressure to take the next 
step and link student performance to teacher performance. 
These developments occurred in parallel but interacted in 


ways policymakers had not originally envisioned. New 
policies create new politics. 


This new conception gave 
reform-minded policymakers 


political cover for substantive 


changes to the relationship among 


teachers, students and schools 


The motivation for new research was driven by a 
combination of the Clinton-era “reinventing govern- 
ment” and educational excellence movements in the 
1990s. Under the guidance of Vice President Al Gore, 
the Government Performance and Results Act of 1993 
(GPRA) sought to “systematically hold . . . agencies 
accountable for achieving program results” because “waste 
and inefficiency in Federal programs undermine the con- 
fidence of the American people in the Government.” 
Although American education was not the focus of Gore's 
efforts, GPRA was a manifestation of the results-oriented 
policy space in which trust-based teaching could be 
undermined. As NAEP results weakened policymakers’ 
confidence in ESEAs original theory, seeking to hold 
schools “accountable for results” was a small step. The 
educational excellence movement provided the appropri- 
ate “results”: high (or at least improving) test scores. As 
such, a handful of education reformers and scholars in 
the early 1990s suggested that school choice and market- 
like incentives could revolutionize the quality of Ameri- 
can education, where “quality” meant “academic 
performance.”4! These reforms suggested that outsiders, 
whether state panels, parents, students, or voters, could 
evaluate school and teacher quality better than practition- 
ers themselves. At the time, there was little research to 
support or oppose the claims, and most arguments were 
thought experiments or analogies to private schools or 
desegregation. In 1996, Eric Hanushek characterized the 
situation as frustrating: “We know neither what forms of 
incentive systems are best nor what results we might 
expect . . . evaluation is central” but largely unavailable. 
The call for evaluation of American education found 
supporters in Congress, and the 1994 IASA expanded 
NAEP to measure students’ academic progress. This data 
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supplemented older data that had been repeatedly mined 
for hints about teacher effectiveness with little satisfac- 
tion. NAEP would provide contemporary, if cross- 
sectional, data. 


The Educational Testing Service 
report signaled that reformers could 


target specific teaching practices to 


improve learning instead of trying to 


attract better teacher candidates 


with higher pay. 


This newly available data became the foundation of 
studies questioning the long-standard claim by teachers’ 
unions that certified and highly paid teachers benefited 
students. In 2000, the Educational Testing Service 
released a study by Harold Wenglinsky suggesting that 
teachers’ content knowledge and specific pedagogical 
practices (such as frequent quizzes and hands-on instruc- 
tion) had a significant effect on students’ NAEP math 
and science scores. Wenglinsky found little relationship 
between those scores and either teacher pay, retention, or 
general professional development.*? Although the study's 
conclusions were weakened by the limitations of 1996 
NAEP data (they were cross-sectional rather than longitu- 
dinal; they had no reliable information about students’ 
socioeconomic status; they had no school-level informa- 
tion; and nonscore information was self-reported), the 
report signaled that reformers could target specific teaching 
practices to improve learning instead of trying to attract 
better teacher candidates with higher pay. Further, the 
study found that many traditional professional development 
programs—trun by colleges of education and teachers’ 
organizations—had little or even negative effects on math 
and science scores. 

Concerns about the variability of teaching moved 
beyond the closed world of education policy experts. The 
private-sector Teaching Commission, headed by former 
IBM CEO Lou V. Gerstner, recommended substantial 
changes to teacher preparation to enhance the quality of 
new teachers in 2004. Although the panel’s recommenda- 
tions were not unique, the high-profile nature of the 


commission suggested that the private sector would use 
data to analyze effectiveness. By middecade, outspoken 
Democrats in Congress, independent researchers, the 
US Department of Education, and the business sector 
had all weighed existing teacher quality policy and found 
it wanting. With that, efforts to replace trust with meas- 
urement became a political juggernaut. 

These reports questioned whether the black box of 
teacher quality was actually opaque, but they only fore- 
shadowed more critical reports built on the avalanche of 
data that NCLB generated, the second development lead- 
ing to the erosion of trust in credentials. NCLB’s AYP 
data were not intended to measure teachers, a fact that 
critics of the law are not shy to highlight, but the data 
were more detailed, more uniform, and more widely 
available than any previous educational achievement data. 
The temptation became too much to resist. The National 
Council on Teacher Quality produced a scathing prelimi- 
nary report in 2004 using state-provided data, but its 
conclusions were confirmed by a 2007 federal report 
using NCLB data, Teacher Quality under NCLB: An 
Interim Report. One of the lead researchers accused states 
of undermining teacher quality, saying, “The high com- 
pliance rate suggests there were states that set the bar low, 
and, in a way, grandfathered in a lot of teachers.”44 Both 
reports criticized HOUSSE, as had many others, but they 
delivered a damning blow because they used federal data. 
A decade earlier, researchers would have had to design a 
research protocol and synthesize disparate data. HEA and 
NCLB consolidated the data, bringing the policy issue 
into the sunlight. If HQT was not the strong medicine 
that some of its initial boosters had hoped to give to 
slow-moving states and colleges of education, policymak- 
ers could have written its failure off as another case study 
in the difficulty of reforming loosely coupled systems like 
schools, or why federal policy flounders in the face of 
limited state capacity.#? HQT did not itself contribute 
to reframing teaching, but the proliferation of student 
assessment in the wake of NCLB created new opportuni- 
ties for research to link teachers and students. That is, 
NCLB had not improved teacher quality in the way it 
had intended. HQT was meant to bolster trust in teach- 
ers ability by requiring more credentials, but HQT only 
accelerated the movement to measure teachers using data 
collected from students. 

Middecade, a series of reports indicated that teachers’ 
credentials had little to no value in predicting student 
achievement. Research completed in 2006 by Thomas 
Kane and his coauthors showed that math performance by 
students in classrooms with alternatively certified teachers 


was indistinguishable from that in classrooms with tradi- 
tionally certified teachers in the Los Angeles Unified 
School District (Kane’s research is particularly notable 
because AFT president Randi Weingarten singled him out 
as a trustworthy researcher in a 2010 address).46 Hanushek 
and his colleagues found similar noneffects of certification 
and master’s degree requirements in Texas.*” Likewise, in 
2007, Robert Pianta and his coauthors found wide varia- 
tion in student performance in a multistate study of mid- 
dle-class students—so wide that the findings “did not 
appear congruent with the high performance standards 
expected for students or for teachers as described by most 
state teacher certification and licensure documents.”4° Fur- 
ther, outside the academic world, NCLB’s student data 
continued to show classroom-level gaps between white and 
nonwhite student groups—even in states such as Wiscon- 
sin, Montana, Oklahoma, and Connecticut that reported 
upwards of 98 percent highly qualified teachers. By the 
end of the decade, policymakers had an indisputable 
research base showing that teacher licensure itself, used to 
justify treating teachers as equivalent inputs, was unrelated 
to teachers’ effectiveness in the classroom. The “one lever” 
Darling-Hammond said states had over the teaching work- 
force turned out to be futile tinkering toward utopia. 

As research confirmed the political reframing of 
teaching, critics of NCLB’s absolute benchmark for AYP 
sought to replace it with a student- or classroom-based 
value-added measure. Value-added measures were some- 
thing of a halfway covenant between the US Department 
of Education and practitioners, and they have generated 
lukewarm interest in teachers’ union affiliates.” Value- 
added models gave the Department of Education a politi- 
cal “out” when it became clear that a large number of 
schools and districts would be identified as “in need of 
improvement” using the fixed-target AYP. In 2005, Secre- 
tary Spellings allowed nine states to pilot value-added 
models of academic growth. A study that year by the 
National Association of State Boards of Education praised 
the decision, saying that the change “moveld] the discus- 
sion about teacher quality to where it belongs: centered 
on increasing student learning as the primary goal of 
teaching.”°° The Obama administration made value- 
added models a de facto policy in 2009 by requiring RTT 
applicants to use them to measure student achievement, a 
move that Secretary Duncan later said would “[hold] us 
all accountable for the quality of education we provide to 
every single student in America.”*! For teachers’ union 
affiliates, value-added approaches have been embraced as a 
way to mentor new teachers and identify students need- 
ing extra help.>* In a simultaneous effort beginning in 


2005, the Data Quality Campaign began to push states to 
create data systems to track individual student scores to 
make value-added evaluation feasible. 

Yet for both the administration and unions, the shift 
to value-added models middecade acknowledged the 
changing framing of teacher quality. The fixed AYP 
standard implied that students should meet the standard 
regardless of their teachers, but value-added standards sug- 
gest that different students may respond differently to 
different teachers. Teachers are then not interchangeable 
but individuals who differ in their effectiveness. 


The proliferation of student 


assessment in the wake of NCLB 


created new opportunities for 


research to link teachers 


and students. 


Elixirs and Effectiveness: Finding 
Quality Teachers 


The reframing of teaching in the 2000s opened a policy 
window that had been closed a generation before by 
ESEA%s intensive focus on school resources. Policymakers 
do not talk about teacher quality in 2012 the same way 
they did in 2001. Today, the debate is better informed by 
research, better linked to student learning, and thoroughly 
national. For policymakers, the central takeaway message 
from the decade is that the next generation of teacher qual- 
ity must be tightly targeted to improving results. No seri- 
ous policymaker now suggests that requiring more hours 
of general professional development will improve student 
outcomes, and many of them doubt that traditional 
teacher certification is a marker of quality. Sensing the 
opportunity for a dramatic overhaul of teaching, policy 
entrepreneurs swept into the legislative marketplace with 
elixirs to remedy American education through teaching 
reform. Yet California governor Jerry Brown singled out 
“experts and academics and foundation consultants” for 
their exuberance in January 2012, saying, “it is salutary 
and even edifying that so much interest is shown in the 
next generation, [but] in a state with six million students, 
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300,000 teachers, deep economic divisions, and a hun- 
dred different languages, some humility is called for.”° 
As with California, so with the nation. None of these 
medicines can cure the patient in the way their boosters 
suggest, but each represents a small, positive step toward 
understanding teachers as individuals rather than as an 
undifferentiated class of employees. In broad strokes, pol- 
icy entrepreneurs propose to remedy low teacher quality 
by addressing teacher preparation, teacher recruitment, or 
teacher tenure. 


Teacher Preparation. One elixir, and the least disruptive 
to the status quo, suggests that overhauling teacher educa- 
tion is the key to closing the teacher quality gap. Propo- 
nents acknowledge that colleges of education produce 
teachers of widely varying skill but prefer to emphasize 
improving “skills” of teachers rather than rating them on 
student results. According to Darling-Hammond, “Devel- 
oping more-skillful teaching is a sine qua non for attaining 
higher and more equitable achievement for students in the 
United States.” °4 Darling-Hammond and others suggest 
that colleges of education develop residency-like programs 
to minimize lower-quality teaching that is characteristic of 
first- and second-year teachers. The residency model got a 
boost from Secretary Duncan in 2012 when he suggested 
that states could require evidence of effective teaching 
before granting a license.®* One critic of the approach con- 
ceded that “no one is even against high-stakes evaluation” 
anymore.*° Others have suggested increasing content-spe- 
cific courses. Along these lines, the National Council for 
Accreditation of Teacher Education is boosting standards 
for teacher education programs. Colleges of education 
have been slow to report on their actual practices, and 
officials are worried about how a residency-like model 
would affect the cost and course structure of college. Still, 
these institutions know they remain under the microscope 
for improving student performance. 

A second elixir some propose is to require existing 
teachers to gain advanced credentialing, especially 
National Board certification. Unlike traditional teacher 
preparation programs, National Board certification is inten- 
sive and content-based, and it requires reflection on peda- 
gogy. It also has a rigorous exam (the pass rate for the 
Praxis II certification exam in a given year is approxi- 
mately 90 percent, while the first-time National Board 
pass rate is near 50 percent).°” Districts have used. National 
Board certification as a distinction for teacher quality. 
Between its inception in 1993 and 2011, 97,281 teachers 
had become so certified, although 48 percent of these 
are in Florida, North Carolina, South Carolina, and 


Washington, states that give financial bonuses for certifi- 
cation.°® As a boost to teacher effectiveness, this reform 
shows mixed results. It does appear that National Board 
certification is a fair indicator of quality teaching. Two 
studies of North Carolina teachers with board certifica- 
tion, published in 2007 and 2010, found statistically 
significant improvements in student test scores versus 
non—board certified teachers, although not necessarily 
uniformly across the curriculum. In particular, board cer- 
tification seemed a better marker in lower grades and for 
low-income students.” Yet as a way to improve existing 
teachers’ effectiveness, National Board certification 
appears to have little to offer. One of the studies just 
mentioned found that National Board teachers were no 
more effective after receiving the certification than before, 
and a 2009 study of Florida teachers found no consistent 
benefits for any grade or student exam.°° 


By the end of the decade, policy- 


makers had an indisputable research 


base showing that teacher licensure 
itself was unrelated to teachers’ 


effectiveness in the classroom. 


Teacher Recruitment. Other policy entrepreneurs suggest 
that the root problem of low teacher effectiveness is not a 
problem of preparation but of supply. Traditionally certi- 
fied teachers tend to have lower academic scores in col- 
lege than graduates as a whole, and some academic areas 
are routinely difficult to fill, especially math, engineering, 
and natural sciences.°! The solution, according to these 
entrepreneurs, is to make teaching more attractive to 
potentially excellent teachers who forgo traditional 
teacher preparation because the opportunity cost of an 
education degree is too high. 

Alternative certification would bring in new, high- 
quality teachers with real-world experience or a passion for 
teaching. Because research suggests that general pedagogical 
courses or professional development have little impact on 
teaching effectiveness, alternative certification could bolster 
the ranks of content-expert teachers or high-performing 
college graduates who would not otherwise have considered 
teaching—especially in low-performing, low-income 


districts. The most highly publicized alternative to tradi- 
tional teacher preparation is Teach For America (TFA), 
a nonprofit organization formed in 1990 to address 
teacher shortages in low-income schools. TFA’s “corps 
members” make an explicit two-year commitment to a 
designated school. Surveys of TFA teachers suggest that 
about 60 percent of them remain in teaching for more 
than two years and 28 percent over five.°? Those num- 
bers, however, are not for teachers remaining in their origi- 
nal school, and other studies of TFA suggest very high 
turnover rates. Using data from 2003 to 2008, for exam- 
ple, 79 percent of TFA teachers had left the New York 
City public schools after four years compared with 31 per- 
cent of new but traditionally certified teachers.© As a 
solution to teacher shortages, then, TFA does not appear 
promising. Turnover is problematic because it creates a 
school environment with little community, increases per- 
sonnel costs as districts seek to replace teachers, and poten- 
tially weakens student achievement as new teachers with 
little experience replace departing ones. 

Alternative certification does not necessarily assume 
that teachers will enter a lifetime of service. Consistent with 
the new framing of quality, alternatively certified teachers 
are expected to be effective—as measured by outside review- 
ers. TFA teachers are unquestionably highly qualified in 
terms of degrees and college entrance exam scores; the 
same New York study found that high school TFA math 
teachers scored 110 points higher on SAT math than tradi- 
tionally certified high school math teachers. Whether that 
academic prowess translates into classroom success is an 
elusive answer, however. Louisiana TFA teachers appear to 
consistently out-perform similarly experienced non-TFA 
teachers and hold their own with more experienced teach- 
ers, but Houston teachers produce comparable or worse 
results than non-TFA teachers. New York TFA teachers 
produce results slightly better than comparably experienced 
teachers in some cases and worse in others—but generally 
better than other alternatively certified teachers.°° And 
therein lies the rub: unless schools and districts can over- 
come high turnover among alternatively certified teachers, 
alternative certification is unlikely to transform American 
education in the neediest of districts. 


Teacher Tenure. A last group of policy entrepreneurs 
argue that student performance should directly reshape 
the teacher workforce by calling into question the 
assumptions of traditional teacher tenure. And, unlike 
either teacher training or teacher recruitment, workforce 
policies seek to overhaul the existing teacher corps in pur- 
suit of effective teaching outputs. Critics of traditional 


teacher tenure argue that treating similarly qualified 
teachers as equal denies voters the right to hold teachers 
accountable for their performance—and by extension, 
their use of taxes. There are two basic approaches to 
reforming tenure. The first, differential pay, severs the 
link between years of experience and pay. The second 
abolishes tenure entirely and ties teachers’ employment to 
demonstrated effective teaching. 

Proponents of differential pay, also known as merit 
pay, assume that teachers respond to financial incentives. 
The assumption itself is not controversial: teachers’ 
unions have argued since their inception that higher pay 
will increase the quality of teachers and that the level of 
teacher pay discourages potential teachers. One researcher 
estimated that a 10 percent increase in potential non- 
teaching wages reduced the likelihood that the top quar- 
ter of female college students would go into teaching by 
6.4 percent. The same 10 percent increase only reduced 
teaching’s attractiveness by 3.7 percent for women in 
the twenty-fifth to fiftieth percentile.” Yet the dominant 
approach to teacher pay works against using pay as an 
incentive. Most districts pay teachers based on years of 
experience (step) and their level of education (lane). 
Experience and education would be appropriate measures 
of quality if policymakers still understood teaching as a 
black box because both measures reward teachers for 
dedication to their careers; that may in fact help teacher 
supply. But research has amply demonstrated that educa- 
tional credentials have no correlation with teacher quality, 
and student achievement is most sensitive only to the first 
few years of a teacher's career, thereby undermining the 
logic of rewarding teachers solely for their credentials 
and experience. 

Merit pay addresses this disconnect. Instead of 
rewarding teachers for years of service, merit pay offers 
higher salaries or bonuses based on demonstrated student 
performance. Incentives like this have suggestive results. A 
first evaluation of Denver's ProComp program found that 
students with new teachers had consistently higher scores 
than those with Denver teachers overall. High-poverty 
schools were also able to retain more of their new teach- 
ers, although they still had higher turnover than suburban 
schools.®* A competing study in Nashville, however, 
found that simply offering bonuses had no significant 
effect on teacher performance, even bonuses of $15,000. 
Incidentally, the Denver study found similar results for 
existing teachers who opted into the merit system. The 
results of these studies suggest that merit pay can, in fact, 
improve recruitment of higher-quality teachers, but offer- 
ing bonuses to existing teachers by itself will do little to 
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refocus teachers’ existing practice. 

Most dramatically, a second group of policy entre- 
preneurs suggests that students’ short time horizon in 
school dictates that school districts should fire or reassign 
poorly performing teachers now. Although scholars con- 
tinue to struggle with identifying “good” teachers, princi- 
pals and other teachers have little problem reliably 
identifying poor teachers in their schools. This approach 
is most visibly associated with Michelle Rhee’s IMPACT 
teacher evaluation system in the District of Columbia 
Public Schools, but less publicized variations have been 
tried in other districts, including the Houston Independ- 
ent School District and the Hamilton County (Ten- 
nessee) school district.’° Initial evaluations of laying off 
the least effective teachers indicate not only that student 
achievement would improve but also that districts would 
ultimately lay off fewer personnel than under a seniority- 
based layoff system.”! In the short term, firing the worst 
teachers may produce strong, positive results, but the 
approach still leaves unaddressed why some teachers enter 
the system ill prepared. Hiring teachers only to fire them 
shortly thereafter is a destabilizing, expensive process for 
schools that, in the long term, could undermine short- 
term gains. 


Conclusion 


When James Coleman and his coauthors released EEO in 
1966, they despaired about the ability of schooling to rec- 
tify pervasive inequity in American education. Yet, since 
then, researchers have confirmed that teaching is a central 
driver of student achievement. In the last fifteen years, 
policymakers have moved from trusting teachers’ creden- 
tials to measuring teaching to verify success in the class- 
room. New data and better research have opened a policy 
window for creative and meaningful changes to teacher 
accountability. All of the top-line reforms are plagued by 
uncertainty—When can good teachers be identified? 
When is experience more important than training? How 
much should student achievement contribute to a teacher's 
evaluation?—but all of the questions address the core 
enterprise of schools: student learning. 

The change from trust to measurement occurred not 
so much from a failure of teachers but from a recognition 
that teacher education programs were cookie-cutter solu- 
tions for infinitely variable education problems. Identical 
credentials granted at the beginning of teachers’ careers 
cannot adequately predict how well teachers will fare in 
widely and predictably different classrooms. Teachers, 


too, are not uniform individuals, a fact that even the 
stepped credentialing popular in the 1990s hid. Measure- 
ment allows policymakers to remedy poor teacher per- 
formance for students’ sakes and to place teachers in 
classrooms where they can expect to thrive. 


For policymakers, the central take- 


away message from the decade is 


that the next generation of teacher 
quality must be tightly targeted 


to improving results. 


Nevertheless, the small steps of the 2000s toward 
the next generation of teacher quality face technological 
and political difficulties going forward. Technologically, 
remote instruction has challenged the connection 
between teaching and learning. Unlike the cable televi- 
sion instruction promoted by some states in the 1980s, 
contemporary Internet-based education can provide self- 
paced learning for students who are comfortable with— 
and expect— interactions with computers. As distance 
learning expands in K-12 education, the link between 
student performance and teacher quality will become 
increasingly tenuous. For students, the advantage of 
remote education is that it can be tailored much better 
to their own education level and needs because students 
interact with the curriculum independently and individu- 
ally. The quality of online, “content” teachers could be 
measured minutely by tracking the time it takes remote 
teachers to respond to questions, the number of attempts 
students take on any particular assignment, or even the 
number of interactions a teacher has with a particular 
student. On-site teachers will provide less primary instruc- 
tion and more task management for students; the quali- 
ties of these teachers will differ from those needed by 
“content” teachers. Assessment of student learning will be 
harder to appropriately assign. 

Politically, the bipartisan consensus around teacher 
quality may be challenged by renewed concerns about 
American federalism. Although state governments eagerly 
adopted the Common Core Standards Initiative, their 
eagerness sprang at least as much from a desire to defuse 
federal efforts as from a desire to create uniform stand- 
ards. The Obama administration has not been shy about 
using federal power to trump state policymakers in 
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immigration, health care, and education policy. To the 
extent Republicans (and state-level Democrats) character- 
ize teacher quality initiatives as intrusive federal policy, 
the next generation of teacher quality may become a 
political albatross for its supporters in the states. 

Despite these challenges, the next generation of 
teacher quality will continue to positively reshape Ameri- 
can education. In a world that demands economic 
competitiveness both locally and internationally, the pre- 
sumption that all teachers are equal in talent thwarts the 
American dream of equal educational opportunity. For 
teachers, the de facto loss of trust in credentials suggests 
that they must place greater emphasis on student learn- 
ing. This does not need to be a negative thing. Although 
Title II does not seem to have dramatically boosted 
teacher quality in the way its early supporters hoped, 
schools and school districts have had to think about 
teacher effectiveness in a new way. Teachers may endure 
greater scrutiny in their classrooms, but that may also 
mean that they will receive better, individual, targeted 
pedagogical training, consistent and meaningful evaluation 
of their practice, and better understanding of the use of 
evaluation data. If so, they will benefit from these changes. 

For students, especially low-performing students, 
policymakers’ focus on the results of teaching has 
improved their chances of receiving a better education. 
Untargeted spending is politically easy and often explic- 
itly supported by teachers’ unions, but it perpetuates 
inequality as teachers are rewarded for vot taking chal- 
lenges as much as they are for taking challenges. Boosting 
teacher quality through targeted reforms will not cure 
American education once for all, but it will reward teach- 
ers who take challenges and succeed at them—because 
their students will, too. 
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